AMENDMENTS 



In the Specification: 

Please replace the entire section entitled "Brief Description of the Drawings" on page 3, 
lines 6 throu£h_24; " * * " - — 



Figure 1 shows the sequence of HCV protease (SEQ ID NO: 69 and SEQ ID NO:70). 

Figure 2 shows the polynucleotide sequence and deduced amino acid sequence of the 
clone C20c (SEQ ID NO: 71 and SEQ ID NO: 72). 

Figure 3 shows the polynucleotide sequence and deduced amino acid sequence of the 
clone C26d (SEQ ID NO: 73 and SEQ ID NO: 74). 

Figure 4 shows the polynucleotide sequence and deduced amino acid sequence of the 
clone C8h (SEQ ID NO: 75 and SEQ ID NO: 76). 

Figure 5 shows the polynucleotide sequence and deduced amino acid sequence of the 
clone C7f (SEQ ID NO: 77 and SEQ ID NO: 78). 

Figure 6 shows the polynucleotide sequence and deduced amino acid sequence of the 
clone C31 (SEQ ID NO: 79 and SEQ ID NO: 80). 



Figure 7 shows the polynucleotide sequence and deduced amino acid sequence of the 
clone C35 (SEQ ID NO: 81 and SEQ ID NO: 82). 



clone C33c (SEQ ID NO: 83 and SEQ ID NO: 84). 

Figure 9 schematically illustrates assembly of the vector C7fC2OcC300C200. 

Figure 10 shows the sequence of vector cflSODp600 (SEQ ID NO: 85 and SEQ ID NO: 

86). 



Please replace the paragraph starting on page 6, line 23 with the following: 

The term "HCV protease" refers to an enzyme derived from HCV which exhibits 



proteolytic activity, specifically the polypeptide encoded in the NS3 domain of the HCV 




BRIEF DESCRIPTION OF THE DRAWINGS 




Figure 8 shows the polynucleotide sequence 



and deduced amino acid sequence of the 
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genome. At least one strain of HCV contains a protease believed to be substantially encoded by 



or within the following sequence: 



Arg Arg Gly Arg Glu He Leu Leu Gly Pro 
Ala Asp Gly Met Val Ser Lys Gly Trp Arg 
Leu Leu Ala Pro He Thr Ala Tyr Ala Gin 
Gin Thr Arg Gly Leu Leu Gly Cys He He 
Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin 
Val Glu Gly Glu Val Gin He Val Ser Thr 
Ala Ala Gin Thr Phe Leu Ala Thr Cys He 
Asn Gly Val Cys Trp Thr Val Tyr His Gly 
Ala Gly Thr Arg Thr He Ala Ser Pro Lys 
Gly Pro Val He Gin Met Tyr Thr Asn Val 
Asp Gin Asp Leu Val Gly Trp Pro Ala Ser 
Gin Gly Thr Arg Ser Leu Thr Pro Cys Thr 
Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 
Arg His Ala Asp Val lie Pro Val Arg Arg 
Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser 
Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly 
His Ala Val Gly He Phe Arg Ala Ala Val 
Cys Thr Arg Gly Val Ala Lys Ala Val Asp 
Phe He Pro Val Glu Asn Leu Glu Thr Thr 
Met Arg (SEQ ID NO: 1) 
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Please replace the paragraph on page 8, starting on line 18 with the following: 



TABLE 1 : Alignment of Active Residues by Sequence 



Protease 



His 



Ser 



HCV CWTVYHGAG(SEQIDNO:2) 

Yellow Fever FHTMWHVTR(SEQ ID NO:5) 
West Nile Fever FHTLWHTTK(SEQ id NO:8) 
Murray Valley FHTLWHTTR(SEQ ID NO. 1 1 ) 
Kunjin Virus FHTLWHTTK(SEQ ID NO.14) 



DODLGWPAP(SEQ ID NO:3) 
KEDLVAYGG(SEQ ID NO:6) 
KEDRLCYGG(SEQ ID NO:9) 
KEDRVTYGG(SEQ ID NO:12) 
KEDRLCY GG(SEQ ID NO: 15) 



LKGSSGGPL(SEQ ID NO:4) 
PSGTSGSPI(SEQ ID NO:7) 
PTGTSGSPI(SEQ ID NO:I0) 
PIGTSGSPI(SEQ ID NO: 13) 
PTGTSGSPI(SEQ ID NO: 16) 



Please replace the paragraph on page 9, starting on line 9 with the following: 



TABLE 2: Alignment of Active Residues by Structure 
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Protease 



His 



Asp 



Ser 



S. griseaus A 

a-Lytic protease 

Bovine Trypsin 

Chymotrypsin 

Elastase 

HCV 



TAGHQSEQ ID NO: 17) 
TAGHQSEQ ID NO:20) 
SAAHC(SEQ IDNO:23) 
TAAHQSEQ IDNO.26) 
TAAHQSEQ ID N0.29) 
TVYHG(SEQ IDNO:32) 



NNDYGII(SEQ ID NO:18) 
GNDRAWV(SEQ ID NO:21) 
NNDIMLI(SEQ IDNO:24) 
NNDITLL(SEQ ID NO:27) 
GYDIALL(SEQ ID NO:30) 
SSDLYLV(SEQ ID NO:33) 



GDSGGSL(SEQ ID NO: 19) 
GDSGGSW(SEQ IDNO:22) 
GDSGGPV(SEQ IDNO:25) 
GDSGGPL(SEQ ID NO:28) 
GDSGGPL(SEQ IDNO:31) 
GSSGGPL(SEQ ID NO:34) 



Please replace the paragraph on page 13 starting at line 1 1 with the following: 
A presently preferred expression system employs the ubiquitin leader as the fusion 
partner. Copending application USSN 7/390,599 filed 7 August 1989 disclosed vectors for high 
expression of yeast ubiquitin fusion proteins. Yeast ubiquitin provides a 76 amino acid 
polypeptide which is automatically cleaved from the fused protein upon expression. The 
ubiquitin amino acid sequence is as follows: 



Gin lie Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr 
Leu Glu Val Glu Ser Ser Asp Thr He Asp Asn Val Lys 
Ser Lys He Gin Asp Lys Glu Gly He Pro Pro Asp Gin 
Gin Arg Leu He Phe Ala Gly Lys Gin Leu Glu Asp Gly 
Arg Thr Leu Ser Asp Tyr Asn He Gin Lys Glu Ser Thr 
Leu His Leu Val Leu Arg Leu Arg Gly Gly (SEQ ID NO:35) 

Please replace the paragraph on page 21 starting at line 4 with the following: 
In the absence of this protease activity, the HCV polyprotein should remain in its 
unprocessed form, and thus render the virus noninfectious. Thus, the protein is useful for 
assaying pharmaceutical agents for control of HCV, as compounds which inhibit the protease 
activity sufficiently will also inhibit viral infectivity. Such inhibitors may take the form of 
organic compounds, particularly compounds which mimic the cleavage site of HCV recognized 
by the protease. Three of the putative cleavage sites of the HCV polyprotein have the following 
amino acid sequences: 

Val-Ser-Ala-Arg-Arg // Gly-Arg-Glu-Ile-Leu-Leu-Gly (SEQ ID NO:36) 
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Ala-Ile-Leu-Arg-Arg // His-Val-Gly-Pro- (SEQ ID NO:88) 
Val-Ser-Cys-Gln-Arg // Gly-Tyr- (SEQ ID NO:89) 



Please replace the paragraph on page 24 starting at line 16 with the following: 
First, DNA isolated from pSODCFl was treated with BamHI and EcoRI, and the 
following linker was ligated into the linear DNA created by the restriction enzymes: 

GAT CCT GGA ATT CTG ATA AGA CCT TAA GAC TAT TTT AA (SEQ 

ID NO:37) 



1 



1 



Please replace the paragraph on page 25 starting on line 1 with the following: 



Three separate expression vectors, pcfl AB, pcflCD, and pcflEF were created by 
ligating three new linkers, AB, CD, and EF to a BamHI-EcoRI fragment derived by digesting 
to completion the vector pSODFl with EcoRI and BamHI, followed by treatment with 
alkaline phosphatase. The linkers were created from six oligomers, A, B, C, D, E, and F. 
Each oligomer was phosphorylated by treatment with kinase in the presence of ATP prior to 
annealing to its complementary oligomer. The sequences of the synthetic linkers were the 
following: 

Name DNA Sequence (5 ' to 3 ') 



A 
B 



GATC CTG AAT TCC TGA TAA(SEQ ID NO:38) 

GAC TTA AGG ACT ATT TTA A(SEQ ID NO:39) 



C 
D 



GATC CGA ATT CTG TGA TAA(SEQ ID NO:40) 

GCT TAA GAC ACT ATT TTA A(SEQ ID NO:41) 



E 
F 



GATC CTG GAA TTC TGA TAA(SEQ ID NO:42) 

GAC CTT AAG ACT ATT TTA A(SEQ ID NO:43) 



1* 



Please replace the paragraph on page 27 starting on line 2 with the following: 



Clone C33c was isolated using a hybridization probe having the following sequence: 
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5' 



ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT 3' (SEQ ID NO:44) 



Please replace the paragraph on page 27 starting on line 7 with the following: 



Clone 35 was isolated by screening with a synthetic polynucleotide having the sequence: 
5' AAG CCA CCG TGT GCG CTA GGG CTC AAG CCC 3* (SEQ ID NO:45) 



/o 



Please replace the paragraph on page 27 starting on line 20 with the following: 



Clone 7f was isolated using a probe having the sequence: 
5'-AGC AGA CAA GGG GCC TCC TAG GGT GCA TAA T-3* (SEQ ID NO:46) 



Please replace the paragraph on page 27 starting on line 24 with the following: 



Clone C20c is isolated using a probe having the following sequence: 
5'-TGC ATC AAT GGG GTG TGC TGG-3' (SEQ ID NO:47) 



Please replace the paragraph on page 28 starting on line 6 with the following: 

Clone 8h was isolated using a probe based on the sequence of nucleotides in clone $5c. 



i 



The nucleotide sequence of the probe was 

5'-AGA GAC AAC CAT GAG GTC CCC GGT GTT C-3' (SEQ ID NO:48) 



1> 



p 



Please replace the paragraph on page 28 starting on line 1 1 with the following: 
Clone C26d is isolated using a probe having the following sequence: 
5'-CTG TTG TGC CCC GCG GCA GCC-3' (SEQ ID NO:49) 



Please replace the paragraph on page 30 starting on line 8 with the following: 



This vector contains a full-length HCV protease coding sequence fused to the FLAG 

sequence, Hopp et al. (1988) Biotechnology 6: 1204-1210. PCR was used to produce a HCV 

protease gene with special restriction ends for cloning ease. Plasmid p500 was digested with 
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EcoRI and Ndel to yield a 900 bp fragment. This fragment and two primers were used in a 
polymerase chain reaction to introduce a unique Bglll site at amino acid 1009 and a stop codon 
with a Sail site at amino acid 1262 of the HCV-1, as shown in Figure 17 of WO/1 1089, 
published 4 October 1990. The sequence of the primers is as follows: 

5' CCC GAG CAA GAT CTC CCG GCC C 3' (SEQ ID NO:50) 
and 

5' CCC GGC TGC ATA AGC AGT CGA CTT GGA 3' (SEQ ID NO:51) 
After 30 cycles of PCR, the reaction was digested with BG1II and Sail, and the 710 bp fragment 
was isolated. This fragment was annealed and ligated to the following duplex: 
MetAspTyrLysAspAspAspAspLysGlyArgGlu 

CATGGACTACAAAGACGATGACGATAAAGGCCGGGAG (SEQ ID NO: 52) 
CTGATGTTTCTGCTACTGCTATTTCCGGCCCTCTAG (SEQ ID NO: 90) 

The duplex encodes the FLAG sequence, and initiator methionine, and a 5' Ncol restriction site. 
The resulting NCoI/Sall fragment was ligated into a derivative of pCFL 

Please replace the paragraph on page 34 starting on line 4 with the following: 



This vector contains HCV sequence, which includes the wild-type full-length 
HCV protease coding sequence, fused at the 5' end to a SOD coding sequence. Two 
fragments, a 441 bp EcoRI/Bglll fragment from clone 1 lb and a 1471 bp Bgllll/EcoRI 
fragment from expression vector P500, were used to reconstruct a wild-type, full-length 
HCV protease coding sequence. These two fragments were ligated together with an 
)^ EcoRI digested pS356 vector to produce an expression cassette. The expression cassette 
encodes the ADH2/GAPDH hybrid yeast promoter, human SOD, the HCV protease, and 
a GAPDH transcription terminator. The resulting vector was digested with BamHI and a 
4052 bp fragment was isolated. This fragment was ligated to the BamHI digested pAB24 
vector to produce p650. p650 expresses a polyprotein containing, from its amino terminal 
end, amino acids 1-154 of hSOD, an oligopeptide -Asn-Leu-Gly-Ile-Arg- (SEQ ID 
NO:87), and amino acids 819 to 1458 of HCV-1, as shown in Figure 17 of WO 90/1 1089, 
published 4 October 1990. 
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Please replace the paragraph on page 34 starting on line 17 with the following: 

uione lib was isolated from the genomic library of HCV cDNA, ATCC 
accession no. 40394, as described above in Example 3A, using a hybridization probe 
having the following sequence: 

5' CAC CTA TGT TTA TAA CCA TCT CAC TCC TCT 3' (SEQ ID NO:54). 
This procedure is also described in EPO Pub. No. 318 216, Example IV.A.17. 



Please replace the paragraph on page 34 starting on line 22 through page 35, line 12 with 
the following: 

The vector pS3EF, which is a pBR322 derivative, contains the ADH2/GAPDH 
hybrid yeast promoter upstream of the human superoxide dimutase gene, an adaptor, and 
a downstream yeast effective transcription terminator. A similar expression vector 
containing these control elements and the superoxide dismutase gene is described in 
Cousens et al. (1987) Gene 61: 265, and in copending application EPO 196,056, 
published October 1, 1986. pS3EF, however, differs from that in Cousens et al. in that the 
heterologous proinsulin gene and the immunoglobulin hinge are deleted, and Glni54 of 
SOD is followed by an adaptor sequence which contains an EcoRl site. The sequence of 
the adaptor is: 

5' AAT TTG GGA ATT CCA TAA TTA ATT AAG 3' (SEQ ID NO:55) 

V AC CCT TAA GGT ATT AAT TAA TTC AGCT 5' (SEQ ID NO:56) 
The EcoRI site facilitates the insertion of heterologous sequences. Once inserted into 
pS3EF, a SOD fusion is expressed which contains an oligopeptide that links SOD to the 
heterologous sequences. pS3EF is exactly the same as pS356 except that pS356 contains 
a different adaptor. The sequence of the adaptor is shown below: 

5 5 AAT TTG GGA ATT CCA TAA TGA G 3 ' (SEQ ID NO:57) 

3' AC CCT TAA GGT ATT ACT CAG CT 5' (SEQ ID NO:58) 
pS356, ATCC accession no. 67683, is deposited as set forth below. 



Please replace the paragraph on page 37 starting at line 4 with the following: 



8 
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Four synthetic DNA fragments were annealed and ligated** together to create a 
EcoRI/SacI Yellow Fever leader, which was ligated to a EcoRI/SacI digested pGEM®- 
3Z vector from Promega®. The sequence of the four fragments are listed below: 

YFK-1: 

5' AAT TCG TAA ATC CTG TGT GCT AAT TGA GGT GCA TTG GTC TGC 
AAA TCG AGT TGC TAG GCA ATA AAC ACA TT 3' (SEQ ID NO:59) 
YFK-2: 

5' TAT TGC CTA GCA ACT CGA TTT GCA GAC CAA TGC ACC TCA ATT AGC 

ACA CAG GAT TTA CG 3'(SEQ ID NO:60) 

YFK-3: 

5' TGG ATT AAT TIT AAT CGT TCG TTG AGC GAT TAG CAG AGA ACY GAC 

CAG AAC ATG TCT GAG CT 3' (SEQ ID NO:61) 

YFK-4: 

5' CAG ACA TGT TCT GGT CAG TTC TCT GCT AAT CGC TCA ACG AAC 
GAT TAA AAT TAA TCC AAA TGT GTT 3' (SEQ ID NO:62) 
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